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Description 

The present invention relates to a high speed 
processor according to the preamble of claim 1 . 

There are at least two different types of mul- 
tiprocessor computer system in the prior art: (1) in 
the first type, each of at least two processors 
executes all of the instructions of one program, and 
operate simultaneously or concurrently for the pur- 
pose of improving the reliability of the processor 
portion of the computer system; and (2) in the 
second type, each of at least two processors ex- 
ecutes different programs or performs different 
tasks or functions. These types of multiprocessor 
computers are described in US-A-4, 152.763 and In 
US-A-4,395.758. 

The performance of a computer system is 
based, in part, upon the speed at which the proces- 
sor or processors within the computer system ex- 
ecute a set of instructions. Generally speaking, the 
set of instructions is executed sequentially, each of 
the instructions being executed at a certain rate. 
Some of the instructions which comprise the set of 
instructions are easy to implement in hardware and 
may be classified as simple instructions. Other 
instructions are more difficult to implement in hard- 
ware and may be classified as complex instruc- 
tions. The simple instructions are more frequently 
executed than are the complex instructions. How- 
ever, in spite of the fact that the simple instructions 
are more frequently executed than are the complex 
instructions, the instructions of the set of instruc- 
tions (including the simple, more frequently ex- 
ecuted instructions) are usually executed sequen- 
tially at the certain rate. Since the performance of 
the computer system may be measured in terms of 
the time elapsed in sequentially executing each of 
the instructions of the set of instructions, the per- 
formance of the computer system could be im- 
proved if the simple, more frequently executed 
instructions within the set of instructions are ex- 
ecuted more rapidly than the complex, less fre- 
quently executed instructions. 

A data processing system which solves the 
problem of efficiently expanding the basic data 
processing system so that it is capable of perform- 
ing new operations and/or the same operations 
more efficiently, as, for example, at higher speed, 
is described in US- A-3 .242,465. This data process- 
ing system has a high speed processor, with, first 
processing means for rapidly executing a first sub- 
set of instructions, and second processing means 
which is the basic system executing the remaining 
subset. However, this prior art does not show first 
means for storing an instruction as being located in 
the first (added) processing system. The first 
means for storing an instruction is instead located 
in the second (basic) processing means. 



Accordingly, it is the object of the present 
invention to improve the performance of a com- 
puter, system by executing the simple, more fre- 
quently executed instructions of a set of instruc- 

5 tions of one program more rapidly than the com- 
plex, less frequently executed instructions of the 
one program; and to execute the simple, more 
frequently executed instructions more rapidly than 
the complex, less frequently executed instructions 

10 by utilizing a special high speed processor in addi- 
tion to a main processor, the high speed processor 
executing the simple, more frequently executed 
instructions of the one program and the' main pro- 
cessor executing the complex, less frequently ex- 

75 ecuted instructions of the one program, the rate at 
which the high speed processor executes an in- 
struction being greater than the rate at which the 
main processor executes an instruction. 

The solution is described in the characterizing 

20 part of claim 1. The multiprocessor system of the 
present invention is classified within a third type of 
multiprocessor system as contrasted v;ith the first 
type and the second type of multiprocessor system 
discussed in the description of the prior art section 

25 of this application. In the third type, one program is 
executed by both of the processors of the mul- 
tiprocessor system; however, each processor ex- 
ecutes a different subset of instructions of the one 
program. The processors execute their instmctions 

30 of the one program non-concunrently, that is. when 
one processor is executing an instruction, the op- 
eration of the other processor is suspended. One 
processor executes selected, specific instructions 
of the one program for the purpose of improving 

35 the performance of the computer system. The se- 
lected, specific instructions of the one program, 
executed by the one processor, are "simple" and 
"more frequently executed" than are the other 
"complex" instructions of the one program. The 

40 one processor is also designed to execute the 
selected instructions very rapidly for the purpose of 
improving the performance of the computer sys- 
tem. This particular processor is termed a "high 
speed processor". 

45 The rapid execution of the selected "simple" 
and "more frequently executed" instructions is ac- 
complished by overlapping the execution of two 
sequential instructions. That is. while a simple in- 
struction is being executed, the next instruction in 

50 the sequence is being prepared for execution. 
When the high speed processor is not executing a 
simple instruction, the other processor, the main 
processor of the computer system, is executing the 
complex, less frequently executed instructions. 

55 Therefore, the Instruction processing time, asso- 
ciated with the execution of the set of instructions 
of the one program, is decreased by a factor of 
approximately 40 %. 
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A full understanding of the present invention 
will be obtained from a reading of the detailed 
description given hereinbeiow and the accompany- 
ing drawings, which are given by way of illustration 
only, and thus are not limitative of the present 
invention, and wherein: 

Rgure 1 illustrates a high speed processor 
according to the present invention, 
connected to a conventional data 
processing system, for executing 
the more frequently executed 
instructions of a set of instructions 
to be executed by the data pro- 
cessing system; 
Figure 2 illustrates a detailed construction of 
the high speed processor accord- 
ing to the present invention: 
Figure 3 illustrates a detailed construction of 
the instruction decode and control 
circuit 12c shown in Rgure 2; 
Figure 4 illustrates a detailed construction of 
the general purpose registers 1 2d 
shown in Rgure 2; 
Rgure 5 illustrates a chart including several 
example instructions and a percent 
mix figure associated with each in- 
struction; and 
Figure 6 illustrates a detailed construction of 
the opcode decode control circuit 
12c1 shown in Rgure 3. 
Referring to Rgure 1 , a conventional data pro- 
cessing system 10 is illustrated including a mem- 
ory 10a, a main processor 10b connected to the 
memory 10a via lines 10e, 10f, and lOg and a 
control storage 10c representing a Read Only 
fvtemory (ROM) connected to the main processor 
10b. The memory 10a includes a main storage 
lOal and a cache subsystem 10a2 connected to 
the main storage 10a1. Line lOe interconnects the 
cache subsystem 10a2 to the main processor 10b 
and is utilized for the transfer of data. Lines 10f/10g 
are utilized for the transfer of instructions. The 
main processor 1 0b is connected to various periph- 
eral devices via an input/output line lOd. Details 
with regard to the construction of the data process- 
ing system may be found In a technical manual 
directed to the construction and the principles of 
operation of the IBM 370 computer, this manual 
being identified by the number '•GA22-7000-9'*, the 
disclosure of which is incorporated by reference 
into the specification of this application. 

In accordance with the present invention, a 
high speed processor 12 is interconnected be- 
tween memory 10a and main processor 10b via 
lines lOf and lOg. The high speed processor 12 
receives instructions from memory 10a. Of those 
instructions received by the high speed processor 
12, some Instructions are more frequently executed 



than others. The high speed processor 12 deter- 
mines if a received instruction falls within the 
"more frequently executed" category. If the re- 
ceived instruction does fall within this category, the 
5 main processor 10b is prevented from executing 
the instruction, and the high speed processor 12 
executes the instruction. However, if the instruction 
does not fall within the "more frequently executed" 
category, the high speed processor 12 signals the 

10 main processor 10b. AS a result, the main proces- 
sor 10b Is permitted to execute the instruction. In 
this case, the operation of the high speed proces- 
sor 12 is suspended. Since the high speed proces- 
sor 12 is specifically designed to execute certain 

75 selected instructions which fall within the "more 
frequently executed" category, the speed at which 
these selected instructions are executed by the 
high speed processor 12 is much greater than the 
speed at which the other remaining instructions are 

20 executed by the main processor. Therefore, the 
performance of the computer system of the present 
invention, utilizing the high speed processor, is 
optimized and/or increased relative to the computer 
system of the prior art. wherein the high speed 

25 processor 12 is not utilized. 

Referring to Rgure 2, a detailed construction of 
the high speed processor 12 according to the 
present invention is illustrated. In Rgure 2, the high 
speed processor 12 comprises an instruction buffer 

30 12a connected to memory 10a for temporarily stor- 
ing an instruction to be executed. An instruction 
register 12b is connected to the instruction buffer 
12a for receiving the instruction and executing the 
instruction. An output of the instruction register 1 2b 

35 is connected to the main processor 10b for trans- 
ferring the instruction to the main processor 10b. 
The main processor 10b will execute the instruction 
in the event the instruction cannot be executed by 
the high speed processor 12. The output of the 

40 instruction register 12b is also connected to an 
instruction decode and control circuit 12c for de- 
coding the instruction being executed in the in- 
staiction register 12b. The main processor 10b is 
connected to the instruction decode and control 

46 circuit 12c for supplying address information asso- 
ciated with locations within the general purpose 
registers 12d in the event the main processor de- 
sires to retrieve one or more operands from the 
general purpose registers 12d for the purpose of 

50 executing the current instruction. The instruction 
decode and control unit 12c Is connected to the 
main processor 10b via a line labelled "signal main 
processor" for the purpose of signalling the main 
processor to either execute the instruction or not 

55 execute the instruction. If the "signal main proces- 
sor" line is high (binary 1), the main processor 10b 
will execute the instruction. However, if the "signal 
main processor" line is low (binary 0), the main 
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processor will be prevented from executing the 
instruction. If the main processor 10b fails to ex- 
ecute the instruction, the high speed processor 12 
will execute the instruction. 

The instruction decode and control circuit 12c 
is connected to a set of general purpose registers 
I2d via line 12e. The general purpose registers 
12d are connected to an arithmetic logic unit (ALU) 
12f via an A-reglster 12g. The general purpose 
registers 1 2d are also connected to the ALU 1 2f via 
a B-register 12h. An output of the A-register 12g is 
connected to an input of the instruction decode and 
control circuit 12c. The output of the A-register 12g 
is also connected to an input of the main processor 
10b for transferring the data (such as operands) 
stored in the general purpose registers 12d to the 
main processor 10b in the event the main proces- 
sor is executing the instruction. The output of the 
instruction register 12b is also connected to a 
displacement register 12S for storing a displace- 
ment value therein. The output of the displacement 
register 12S is connected to the ALU 12f via line 
12M. An output of the ALU 12f is connected to an 
input of a D-register 12L An output of the instmc- 
tion decode and control circuit 12c is further con- 
nected to the input of the D-register 121. An output 
of the D-register 121 is connected to an input of the 
A-register 12g. the B-register 12h. and the general 
purpose registers 12d. An output of the D-register 
12i is also connected to an input of the memory 
10a. 

The memory 10a is connected to an input of a 
data register 12j. An output of the D-register 12i is 
also connected to another input of the data register 
12j. An output of the general purpose registers 12d, 
which is connected to the A-register 12g. is con- 
nected to another input of the data register I2j via 
line 12Q. An output of the general purpose regis- 
ters 12d, which is connected to the B-register 12h, 
is connected to another input of the data register 
12j via line 12R. An output of the data register 12j 
is connected to a byte shifter 12k for shifting the 
incoming bytes of data a certain amount in order to 
properly align the first bit of the data byte with a 
correct first bit position for the byte of data. The 
byte shifter 12k is connected to an input of the B- 
register 12h. The byte shifter 12k is also connected 
to an input of an OR gate 12L and to memory 10a. 
The output of the general purpose registers 12d, 
which is connected to the A-register 12g, is also 
connected to another input of the OR gate 12L. An 
output of OR gate 12L is connected to another 
input of the instruction register 12b. 

Referring to Figure 3, a detailed construction of 
the instruction decode and control circuit 12c is 
illustrated. In Rgure 3, the instruction decode and 
control circuit 12c comprises an opcode decode 
control circuit 12c1 connected to the instruction 



register 12b for decoding the opcode of the in- 
struction residing in the instruction register 12b to 
determine the type of instruction, for example, an 
EXECUTE instruction. The output of the opcode 
5 decode and control circuit 12c1 is connected to the 
main processor 10b. If the opcode decode control 
circuit 12c1 determines, from the instruction op- 
code, that the instruction fails within the "more 
frequently executed" type, the output signal gen- 

10 erated from the opcode decode and control circuit 
12c1, labelled "signal main processor", will be low 
(i.e. - a binary 0). In response to this low output 
signal, the main processor 10b will not execute the 
instruction. However, the high speed processor 12 

IS will execute the instruction. If the opcode decode 
control circuit 12c1 determines that the instruction 
does not fall within the "more frequently executed" 
type, the output signal generated from the opcode 
decode and control circuit 12cl will be high (a 

20 binary 1). In response to this high output signal, the 
main processor 10b will execute the instruction and 
the high speed processor suspends its operation. 

The instruction decode and control circuit 12c 
further comprises a GPR A address control 12c2 

25 and a GPR B address control 12c3 connected to 
the instruction register 12b for receiving and stor- 
ing the address information from the instruction 
stored in the instruction register 12b. An output 
from the GPR A address control 12c2 and an 

30 output from the GPR B address control 12c3 col- 
lectively constitute line 12e. line 12e being con- 
nected to the general purpose registers 12d. The 
instruction register 12b is also connected to a GPR 
address compare circuit 12c4. An output of the 

35 GPR A address control circuit I2c2 is also con- 
nected to the GPR address compare circuit 12c4. 
The GPR address compare circuit 12c4 compares 
the incoming address, destined for storage in the 
GPR A address control 12c2. with the address 

40 being generated from the GPR A address control 
circuit 12c2, and suspends the execution of the 
next instruction if the compared addresses are the 
same. Alternatively, the address compare circuit 
12c4 may wrap the result from ALU 12f back into 

45 the B-register 12h in order to allow further process- 
ing on said result. In view of the overlapped execu- 
tion of two sequential instructions mentioned pre- 
viously, if the incoming address from the next 
instnjction to be executed is the same as the 

50 address being generated from the GPR A address 
control circuit 12c2, associated with the current 
instruction being executed, the GPR address com- 
pare circuit 12c4 generates a disable overlap sig- 
nal. The disable overlap signal (1) prevents the 

55 instruction decode and control circuit 12c from 
addressing the general purpose registers 12d via 
line 12e, (2) prevents the A-register 12g and the B- 
register 12h from being enabled, and (3) prevents 
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the next instruction from being pre-decoded by the 
. opcode decode control circuit 12c1. As a result, 
while the ALU 12f is performing a processing op- 
eration on the operands previously withdrawn from 
the general purpose registers 12d, the disable 
overlap signal generated from the address com- 
pare circuit 12c4 will prevent the operands asso- 
ciated with next instruction from being withdrawn 
from the general purpose registers 12d. 

The output of the GPR A address control cir- 
cuit 12c2 is also connected to one input of an 
instruction retry buffer 12c5. The. output from the 
A-register 12g is connected to another input of the 
instruction retry buffer 12c5. An output from the 
instruction retry buffer 12c5 is connected to the 
input of the D-register 12i. When the instruction 
being executed directs the operation of two 
operands on one another, one of the two operands 
is stored in the instruction retry buffer 12c5 prior to 
execution of the instruction. If a machine check 
occurs, the instruction retry buffer 12c5 has the 
one operand in storage in the event the instruction 
should be re-executed. As will be indicated below, 
the one operand must be stored in the instruction 
retry buffer 12c5 prior to execution of the instruc- 
tion. The other operand is not destroyed as a result 
of execution of the instruction, and. therefore, need 
not be pre-stored. 

Referring to Rgure 4, a detailed construction of 
the general purpose registers 12d of Rgure 2 is 
illustrated. In Rgure 4. the general purpose regis- 
ters 12d comprise a general purpose A group of 
registers 12d1 (GPR A) and a general purpose B 
group of registers 12d2 (GPR B). It should be 
noted that the contents of the GPR B group 12d2 is 
an exact copy of the contents of the GPR A group 
I2d1. The output from the D-register 121 is con- 
nected to each of the GPR A group and the GPR B 
group, 12d1 and 12d2, respectively. The GPR A 
address control I2c2 is connected to the GPR A 
group 12d1 via line 12e. The GPR B address 
control 12c3 is connected to the GPR B group 
12d2 via line 12e. The GPR A address control 12c2 
and the GPR B address control 12c3 are address 
registers for addressing locations within the GPR A 
group 12d1 and the GPR B group 12d2, respec- 
tively. The GPR A group 12d1 is connected to the 
input of A-register 12g. The GPR B group 12d2 is 
connected to the input of B-register 12h. Each of 
the outputs from GPR A group and GPR B group 
are connected to the input of the data register 12j. 

Refemng to Rgure 5. a chart 20 is illustrated, 
the chart including several example "simple" and 
"more frequently executed" instructions and includ- 
ing several example "complex" and "less frequent- 
ly executed" instructions. In Rgure 5, the chart is 
subdivided Into three columns. A first column 20a 
includes a plurality of instruction acronyms. A sec- 



ond column 20b includes a plurality of instruction 
opcodes, an opcode corresponding to each instruc- 
tion acronym of column 20a. A third column 20c 
includes a plurality of percent mix figures, a mix 
5 figure conresponding to each instruction acronym of 
column 20a. An instruction acronym identifies a 
particular instruction. Since each instruction pos- 
sesses a specific opcode, an instruction opcode 
also identifies a particular instruction. Therefore, 

70 the first and second columns 20a and 20b of chart 
20 include instruction acronyms and opcodes for 
identifying specific instructions. Each instruction, 
on the average, appears in a set of instructions a 
certain plurality of times. When a figure represent- 

75 ing the certain plurality is divided by the total 
number of instructions in the set of instructions, a 
percent mix figure is the result. Therefore, the third 
column 20c of chart 20 includes a plurality of mix 
figures, each mix figure representing a percentage 

20 of the total number of instructions within the set of 
instructions. In Rgure 5, note that instructions 
"BC", "L", and "ST" represent approximately 19 
%, 13 %, and 6 % of the total number of instruc- 
tions within a set of instnjctions. However, note that 

25 instructions "XC". "MVGL". and "MVC" represent 
approximately 0.5 %, 0.08 %, and 1,5 % of the 
total number of instructions within the set of 
instructions. Based on these percentage mix fig- 
ures, instructions "BC", "L", and "ST" represent 

30 simple, more frequently executed instructions 
whereas instructions "XC". "MVCL". and "MVC" 
represent complex, less frequently executed 
instructions. 

Referring to Figure 6, a detailed construction of 

35 at least a portion of the opcode decode control 
circuit 12c1 of Rgure 3 is illustrated. In Rgure 6. 
the opcode decode control circuit t2c1 includes a 
NOR gate 12c1(a). The input terminals of tiie NOR 
gate 12c1{a) are connected to tiie instruction regis- 

40 ter 12b via a plurality of opcode decoders 12c1(c). 
An opcode decoder 12c1(c) is connected to each 
of the input temriinals of the NOR gate I2cl(a). 
Each of the opcode decoders 12c1(c) are designed 
to decode and respond to a different instruction 

45 opcode. For example, if the instruction is a "branch 
on condition (BC)" instruction, the opcode of the 
instruction is "47". Thetrefore. as shown In Rgure 
6, the first opcode decoder 12c1(c). connected to 
the first input terminal of the NOR gate 12c1(a). is 

50 designed to decode and respond to tiiis instruction, 
the first opcode decoder generating an output sig- 
nal. However, the other opcode decoders 12c1(c) 
fail to decode and respond to this opcode, since 
they are designed to decode and respond to dif- 

55 ferent Instiiiction opcodes. The output temiinai of 
Uie NOR gate 12c1(a) is connected to the main 
processor 10b and generates the "signal main pro- 
cessor" output signal as shown in Rgures 2 and 3. 
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The NOR gate 12c1(a) comprises an OR gate 
12cl(a) (1) and an inverter 12c1(a) (2) connected to 
the output of the OR gate 12c1(a) (1). 

The functional operation of the data processing 
system 10. in combination with the high speed 
processor 12. will be described in the paragraphs 
below with reference to Rgure 1 of the drawings. 

An instruction is withdrawn from memory 10a 
and is received by the high speed processor 12. If 
the instruction is one of a set of "more frequently 
executed" instructions, the high speed processor 
12 executes the instruction. The main processor 
10b is prevented from executing the instruction. 
However, if the instruction is not one of the set of 
"more frequently executed" instructions, the main 
processor 10b is not prevented from executing the 
instruction. In this case, the main processor 10b 
executes the instruction and the high speed pro- 
cessor 12 suspends its operation until the main 
processor 10b completes execution of the instruc- 
tion. The control storage IQc is a Read Only Mem- 
ory containing certain specific microcode instmc- 
tions stored therein which are utilized to assist the 
main processor in the execution of an instmction. 
When the main processor 10b is permitted to ex- 
ecute the instruction, the main processor 10b ex- 
ecutes the instruction in accordance with the micro- 
code instructions stored within the control storage 
10c. Since a large proportion of the instructions 
withdrawn from memory 10a fall within the "more 
frequently executed" category, these instructions 
are executed by the high speed processor 12. 

Furthemnore, when the high speed processor 
12 executes the instructions, it is operating in an 
overlap execution mode. During the overlap execu- 
tion mode, while the high speed processor 12 is 
executing a first instruction, it is preparing to com- 
mence execution of a second instruction in a se- 
quence of instructions. As a result, due to its ex- 
ecution of the selected instructions in the overlap 
execution mode, the high speed processor 12 is 
capable of executing the "more frequently execut- 
ed" instructions more rapidly than would be possi- 
ble if the main processor 10b were to execute the 
selected "more frequently executed" instructions. 

As a result, the instruction processing time, 
associated with the execution of the instructions 
withdrawn from memory 10a. is decreased by a 
factor of approximately forty percent (40 %). 
Therefore, the performance of the computer sys- 
tem of the present invention is improved relative to 
the computer systems of the prior art. 

The functional operation of the high speed pro- 
cessor 12. illustrated in Rgure 1. will be described 
in the paragraphs below witii reference to Rgures 2 
through 4 of the drawings. 

An instruction is withdrawn from memory 10a 
and is received by the high speed processor 12. 



The instruction is temporarily stored in the instruc- 
tion buffer 12a and is considered the "next instruc- 
tion to be executed". The instruction is subse- 
quently transmitted to and stored in tiie instruction 

5 register 12b for execution. When transmission of 
the instmction to the instruction register 12b is 
complete, prior to the execution of Uie instruction, 
the opcode decode and control circuit 12c1 of the 
instruction decode and control circuit 12c decodes 

70 the instruction for the purpose of determining if 
said instruction falls within the "more frequentiy 
executed" category. If the instruction does not fall 
witiiin tiiis category, the opcode decode control 
circuit 12c1 generates a high output signal, this. 

16 high output signal being the "signal main proces- 
sor" signai. In response to this high output signal, 
the main processor 10b executes tiie instruction. In 
this case, the operation of the high speed proces- 
sor 12 is suspended. However, if the opcode de- 

20 code and control circuit 12c1 detennnines that the 
instruction does fall witiiln the "more frequentiy 
executed" category, the "signal main processor" 
output signal from the opcode decode control cir- 
cuit 12cl remains low. Therefore, the main proces- 

25 sor 10b is prevented from executing this instruc- 
tion. 

Recall that the insti'uction address information 
associated with a first operand and further instruc- 
tion address information associated with a second 

30 operand is stored in tiie GPR A address control 
12c2 and the GPR B address control 12c3, respec- 
tively. Assume tiiat tiie instruction stored in the 
instruction register I2b is an ADD instruction and 
that two operands are to be added together, the 

35 sum of the two operands being stored in the gen- 
era! purpose registers 12d. Assume further that 
both operands are cunrently stored in tiie general 
purpose registers 12d, one operand being stored in 
the GPR A group 12d1 and the other operand 

40 being stored in the GPR B group 12d2. Recall that 
the contents of the GPR A group 12d1 is an exact 
copy of the contents of tiie GPR B group 12d2. 
The address information stored in the GPR A ad- 
dress control 12c2 is used to address the GPR A 

45 group 12d1, and tiie address information stored in 
the GPR B address control 12c3 is used to address 
the GPR B group 12d2. In fact, groups 12d1 and 
12d2 are simultaneously addressed by the address 
controls 12c2 and 12c3, respectively. As a result, 

50 the time consumed with respect to the addressing 
of the general purpose registers 12d is conserved. 
The conservation of tiiis time assists in the optimiz- 
ation of the performance of tiie computer system. 
When groups I2d1 and 12d2 are addressed. 

55 tiie operands stored tiierein are simultaneously 
witiidrawn and stored in the A-register 12g and tiie 
B-register 12h, respectively. The arithmetic logic 
unit (ALU) 12f retiieves the operands from the A- 
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register and the B-register and performs the in- 
struction operation on the two operands, that is, 
adds them together forming a sum of the two 
operands. The sum is stored in the QPR A group 
12d1 and In the GPR B group 12d2 within the 
general purpose registers 12d via the D-register 
121. 

While the ALU I2f is performing the instruction 
operation on the two operands, the next Instruction 
to be executed in the sequence of Instructions is 
stored in the instruction register 12b and is being 
pre-decoded by the opcode decode control circuit 
12c1 to determine If the next instruction falls within 
the "more frequently executed" category. Conse- 
quently, the high speed processor 12 is operating 
in the overlap execution mode in that the execution 
of the current instruction is overlapped with the 
preparation for execution of the next instruction in 
the sequence. As a result of this overlapped execu- 
tion, the high speed processor 12 Is capable of 
executing the current instruction more rapidly than 
would be possible if the main processor 10b were 
to execute the current Instruction. However, as will 
be noted below, the address compare circuit 12c4 
ensures that execution of the current instruction by 
the ALU 12f is complete before executing the next 
instruction in the sequence, that is. before the 
addressing of the general purpose registers 12d, 
associated with the execution of the next instruc- 
tion, commences. 

Assume that one operand is stored in the GPR 
8 group 12d2 of the general purpose registers 12d 
and the other operand is stored in the memory 
10a. As before, assume that an ADD instruction is 
stored in the Instruction register 12b, whereby the 
one operand is to be added to the other operand, 
and the sum of the two operands is to be stored in 
the QPR B group 12d2 of the general purpose 
registers 12d. Before the operands may be added 
together, the memory 10a must be addressed for 
the purpose of withdrawing the other operand from 
the memory 10a. Therefore, when the opcode de- 
code and control circuit 12c1 within the instruction 
decode and control circuit 12c decodes the instruc- 
tion stored in the next instruction buffer 12b1 and 
determines that the instruction fails within the 
"more frequently executed" category, the "signal 
main processor" line is low, and the GPR A ad- 
dress control 12c2 and the GPR B address control 
I2c3, within the instruction decode and control 
circuit 12c, address the general purpose registers 
12d and, in particular, the GPR A group 12d1 and 
the GPR B group 12d2. When the groups 12d1 and 
12d2 are addressed by the GPR A address control 
12c2 and the GPR B address control 12c3. respec- 
tively, a binary number is read from the GPR A 
group 12d1 and another binary number is read 
from the GPR B group 12d2, the binary numt>ers 



being stored In the A-register 12g and the B- 
register 12h. Further, the instruction stored in the 
instruction register 12b includes a displacement 
value. The instruction displacement -value is stored 
5 in the displacement register 12S. A signal from the 
displacement register 12S, representing the dis- 
placement value, energizes the ALU 12f via line 
12M. Signals from the A-reglster 12g and the B- 
register 12h, representing the binary numbers 

70 stored therein, energize the ALU 12f via lines 12N 
and 12P. The ALU 12f adds a subtotal, represent- 
ing the sum of the binary numbers, to the displace- 
ment value thereby forming a sumtotal, the sum- 
total being stored within the D-register 121. The D- 

75 register 12i generates an address representing the 
sumtotal, the address energizing the memory 10a 
and addressing the memory 10a at a location cor- 
responding to the other operand. The other 
operand is stored in the B-register 12h via the data 

20 register i2j and the byte shifter 12k. The byte 
shifter 12k aligns the received bytes of the other 
operand such that the first bit of the first byte of 
the other operand is aligned with a connect first bit 
position within the B-register I2h. The address 

25 control circuit 12c2 within the instruction decode 
and control circuit 12c addresses the GPR A group 
12d1 within the general purpose register 12d and 
thereby reads the one operand therefrom for stor- 
age within the A-register 12g. The ALU 12f adds 

30 the one operand, stored in the A-register 12g. to 
the other operand, stored in the B-register 1 2h. The 
sum of the two operands is stored in the GPR B 
group 12d2 via the D-register 12i. 

Assume that the instruction stored in the in- 

35 struction register 12b is an EXECUTE instruction. 
This particular execute instruction comprises the 
following characteristic fields: 
EX..MODIFY..SUBJECT INSTRUCTION, where EX 
indicates that the instruction is an execute instruc- 

40 tion. MODIFY is a modify field, and SUBJECT 
INSTRUCTION indicates a subject instruction 
stored in memory 10a and includes a displacement 
value and address information relative to the con- 
tents of tiie QPR A group 12d1 and the GPR B 

45 group 12d2 disposed within the general purpose 
registers 12d. The displacement value, stored with- 
in the displacement register 12S, energizes tiie 
ALU 12f via line 12M. A signal originating from the 
GPR A address control 12c2 and from the GPR B 

50 address control 12c3, representing the address in- 
formation, energizes tine GPR A group 12d1 and 
the GPR B group 12d2 of the general purpose 
registers 12d via line 12e. The contents of the GPR 
A group and the GPR B group, disposed within the 

55 general purpose register 12d. energize the ALU 12f 
via lines 12N and 12P. The ALU 12f adds the 
displacement value to the contents of the GPR A . 
group and the QPR B group fonming another sum. 
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this other sum being temporarily stored in the D- 
register 12i and addressing the memory lOa. The 
subject instruction is withdrawn from the memory 
. 10a and temporarily stored in the dat^ register 12j. 
The bytes of the subject instruction are shifted as 
needed in the byte shifter 12k and energize one 
input of the OR gate 12L 

The modify field (MODIFY) of the execute in- 
stnjction addresses the GPR A group within the 
general purpose registers 12d via line 12e. Specific 
bytes of data are read from the GPR A group 12d1 
within the general purpose register 12d in response 
to the addressing by the modify field data. The 
specific bytes of data energize the other input of 
the OR gate 12L. As a result the bytes of the 
subject instruction, energizing one input of the OR 
gate 12L. are modified in accordance with the 
specific bytes of data energizing the other input of 
the OR gate 12L. A signal is generated from the 
OR gate 12L representing a modified subject in- 
struction, the modified subject instruction being 
stored within the instruction register 12b for execu- 
tion thereof. 

If. during operation in the "overiap execution 
mode", the high speed processor 12 begins to 
address the general purpose registers 12d for the 
purpose of executing the next instruction prior to 
the completion of the execution of the current in- 
struction by the ALU 12f, the address compare 
circuit I2c4 will suspend execution of the next 
instruction. The execution of the next instruction is 
suspended in the following manner. The GPR ad- 
dress compare circuit 12c4 compares the address 
of next instruction to be executed with the address 
of the current instruction being executed, appearing 
on the output of the GPR address control circuit 
I2c2. If the two addresses are the same, the ad- 
dress compare circuit 12c4 generates the disable 
overlap signal. The disable overlap signal (1) pre- 
vents the instruction decode and control circuit I2c 
from addressing the general purpose registers 12d 
via line 12e, (2) prevents the A-register 12g and the 
B-register 12h fronni being enabled, and (3) pre- 
vents the next instruction from being pre-decoded 
by the opcode decode control circuit 12cl. As a 
result, in tiiis instance, tine address compare circuit 
12c4 suspends the execution of the next instruction 
until the execution of tiie current instruction is 
complete. 

When the address control circuit 12c2 gen- 
erates a signal representative of a location within 
the GPR A group 12d1, the location address is 
stored in the iristruction retry buffer 12c5. A first 
operand corresponding to the contents of the loca- 
tion address is also stored in tiie instruction retry 
buffer via the output from the A-register 12g. In the 
event the instiuction should be re-executed, tiiat is, 
when a machine check should occur, the first 



operand is stored in the instruction retry buffer 
12c5 and is available for use. It is not stored in the 
GPR A group 12d1 since the location in A-group 
12d1 and B-group 12d2 corresponding to the first 

5 operand is overlayed by a result when execution of 
the instruction Is complete. The second operand is 
still stored in the GPR A group 12d1 and the GPR 
B group 12d2 and is available for use since the 
contents of this group is not overlayed by a result 

10 when execution of the instruction is complete. 

The contents of the GPR A group 12dl and the 
GPR B group 12d2 are stored in memory I0a via 
lines 12Q and 12R, the data register 12], and the 
shifter12K. 

15 The functional operation of the opcode decode 
control circuit 12c1 of Figure 3 will be described in 
the following paragraphs with reference to Figures 
5 and 6 of the drawings. 

In Rgure 6. the opcode decoders 12cl(c) of 

20 NOR gate 12c1(a) are designed to decode and 
respond to the instructions shown in Figure 5 iden- 
tified by acronyms "BG". "L", and "ST" and by 
opcodes "47", "58". and "50". For example, if tiie 
instruction stored in the instruction register 12b is a 

25 "BC" instruction having opcode "47", only the first 
opcode decoder 12c1{c). connected to NOR gate 
12c1(a) input terminal, will respond. The first op- 
code decoder 12c1(c), connected to the input ter- 
minal of tiie NOR gate 12c1(a) labelled 47, will 

30 generate an output signal. However, the other op- 
code decoders will fail to generate an output signal. 
Assuming that this output signal is a high (binary 1 ) 
signal, the NOR gate 12c1(a) generates a iow 
(binary 0) "signal main processor" output signal. 

35 As mentioned above, the "BC" instruction is a 
"simple, more frequently executed" instruction. 
Since the "signal main processor" output signal is 
low. the main processor 10b is prevented from 
executing tiie "BC" instmction. As a result, the 

40 high speed processor of the present invention ex- 
ecutes the instruction. However, if the instruction 
stored in tiie instruction register 1 2b is an "XC" 
instruction having opcode "D7". none of the op- 
code decoders 12c1(c) respond. Therefore, none of 

45 the opcode decoders 12c1(c) generate an output 
signal. Each of the input terminals of the NOR gate 
12c1(a) are low (binary 0). Due to the operation of 
inverter 12c1 (a)(2), the output signal from the NOR 
gate 12c1(a) is high (binary 1)..This output signal 

50 represents the "signal main processor" output sig- 
nal illustrated in Figures 2 and 3. Since tiie "signal 
main processor" output signal is high, tiie main 
processor 10b executes the "XC" instruction. As 
mentioned above, the "XC" instruction is a 

56 "complex, less frequently executed" instruction. 
Since the "signal main processor" output signal is 
high, the main processor 10b executes the "XC" 
Instruction since it is not prevented from executing 
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the "XC instruction. 



Claims 

1. High speed processor for use in a connputer 
system, said high speed processor executing a 
set of instructions, a first subset of said set of 
instructions being easier to implement in hard- 
ware and being more frequently executed than 
the remaining subset of said set of instructions, 
with first processing means (12) for rapidly 
executing said first subset of said set of 
instructions; and with second processing 
means (10b) executing said remaining subset 
of said set of instructions, characterized 



eration on said one operand and a second 
operand and for generating an output signal 
indicative of the results of said arithmetic op- 
eration. 

5 

3. High speed processor of claim 1 » characterized 
in 

that said first processing means (12) operates 
10 in an overlap execution mode, the decode con- 

trol means (12c) of said first processing means 
(12) pre-decoding a next instruction stored in 
said first means when said arithmetic logic unit 
(12f) of said first processing means (12) is 
15 performing an arithmetic operation on the 

operands associated with a current instruction 
being executed. 



by first means (12a, 12b). located in the first 
processing means (12), for storing an instruc- 
tion; 



4. High speed processor of claims 1 and 3, char- 
20 acterized in that said decode control means 
(1 2c) further comprises: 



by decode control means (12c) connected to 
said first means (12b) and to said second 
processing means (10b) for decoding said In- 2S 
struction stored in said first means and for 
determining if said instruction falls within the 
first subset of said set of instructions or within 
the remaining subset of said set of instructions, 
said decode control means (12c) developing a 3o 
first output signal when said instruction falls 
within the remaining subset of said set of 
instructions and developing a second output 
signal when said instruction falls within the first 
subset of said set of instructions, said second 35 
processing means (10b) executing said instruc- 
tion in response to said first output signal from 
said decode control means (12c). said second 
processing means (10b) being prevented from 
executing said instruction In response to said 40 
second output signal from said decode control . 
means (12c). 

2, High speed processor of claim 1 . characterized 

in that said first processing means (12) further 45 
comprises: 

general purpose registers (12d) connected to 
said decode control means (12c) for storing at 
least one operand therein and for generating so 
an output signal indicative of the one operand 
in response to said second output signal from 
said decode control means {12c); 

an arithmetic logic unit (I2f) connected to said 55 
general purpose registers (12d) and responsive 
to the output signals from said general purpose 
registers (12d) for perfonming an arithmetic op- 



address control means {12c2; 12c3) connected 
to said second processing means (10b) for 
receiving address information from the instruc- 
tion stored in said first means (12a, 12b) and 
for generating said second output signal when 
the instruction falls within the first subset of 
said set of instructions; 

address compare means (12c4) connected to 
an input of said address control means (12c3) 
and to an output of said address control means 
(12c2) for comparing said second output signal 
generated from said address control means 
with the address information being received by 
said address control means and generating a 
disable overlap signal when the address in- 
formation associated with said second output 
signal is the same as the address information 
being received by said address control means, 
said disable overiap signal preventing the de- 
code control means from pre-decoding the 
next instruction stored in said first means 
(12b); and 

a special retry buffer (12c5) for repeating the 
execution of an instruction. 



Revendlcations 

1. Processeur ultra-rapide pour utilisation dans un 
syst^me d'ordinateur. ledit processeur ultra ra- 
plde executant un jeu d'instructions, un pre- 
mier sous-jeu dudit jeu d'instructions etant fa- 
cile k mettre en oeuvre en materiel et ^tant 
plus frequemment ex6cut§ que le sous-jeu res- 
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tant dudit jeu dMnstmctions. avec un premier 
moyen de traitement (12) pour executer rapi- 
dement ledit premier sous-jeu dudit jeu d'ins- 
tructions; et avec un second moyen de traite- 
ment (10b) executant ledit sous-jeu restant du- s 
dit jeu dMnstructions caracterise 

- par un premier moyen (12a, 12b) situi 
dans le premier moyen de traitement 
(12), pour memoriser une instruction; 

- par un moyen de commande de decoda- io 
ge (12c) relie audit premier moyen (12b) 

et audit second moyen de traitement 
(10b) pour decoder ladite instruction me- 
moris^e dans ledit premier moyen et 
pour determiner si ladite instruction tom- 75 
be a I'interieur du premier sous-jeu dudit 
jeu d'instructions ou a I'interieur du sous- 
jeu restant dudit jeu d instructions, ledit 
moyen de commande de decodage (12c) 
elaborant un premier signal de sortie 20 
lorsque ladite instruction tombe a I'inte- 
rieur du sous-jeu restant dudit jeu d'ins- 
tructions et elaborant un second signal 
de sortie lorsque ladite instruction tombe 
a rinterieur du premier sous-jeu dudit jeu 25 
d'instructions, ledit second moyen de 
traitement (10b) executant ladite instruc- 
tion en reponse audit premier signal de 
sortie depuis ledit moyen de commande 
de decodage (12c). ledit second moyen 30 
de traitement (1 Ob) etant empeche d'exe- 
cuter ladite instruction en reponse audit 
second signal de sortie depuis ledit 
moyen de commande de decodage 
(12c). 35 

Processeur ultra-raplde selon la revendication 
1. caracterise en ce que ledit premier moyen 
de traitement (12) comprend de plus : 

- des registres ^ usage general (12d) relie 40 
audit moyen de commande de decodage 
(12c) pour y memoriser au moins un 
operande et pour produire un signal de 
sortie indicatif du premier operande en 
reponse audit second signal de sortie 45 
depuis ledit moyen de commande de 
decodage (12c); 

- une unit^ logique arithmetique (12f) re- 
liee aux dits registres a usage general 
(12d) et sensible aux signaux de sortie 50 
depuis lesdits registres a usage general 
(12d) pour effectuer une operation arth- 
metique sur ledit premier operande et un 
second operande et pour produire un 
signal de sortie indicatif des r^sultats de 55 
ladite operation arithmetique. 

Processeur ultra-raptde selon la revendication 



1, caracterise en ce que ledit premier moyen 
de traitement (12) fonctionne dans un mode 
d'execution en chevauchement, le moyen de 
commande de decodage (12c) dudit premier 
moyen de traitement (12) predecodant une 
prochaine instruction memorisee dans ledit 
premier moyen lorsque ladite unite iogique 
arithmetique (12f) dudit premier moyen de trai- 
tement (12) est en cours d'effectuer une ope- 
ration arithmetique sur les operandes associes 
h une instruction en cours d'execution. 

4. Processeur ultra-rapide selon les revendica- 
tions 1 et 3 caracterise en ce que ledit moyen 
de commande de decodage (12c) comprend 
de plus : 

- un moyen de commande d'adresses 
(12c2. 12c3), reli^ audit second moyen 
de traitement (10b) pour recevoir une 
information d*adresse depuis I'instruction 
memorisee dans ledit premier moyen 
(12a, 12d) et pour produire ledit second 
signal de sortie lorsque Tinstruction tom- 
be a rinterieur du premier sous-jeu dudit 
jeu d'instructions : 

- un moyen de comparaison d'adresses 
(12c4) reli^ h une entree dudit moyen de 
commande d'adresses (12c3) et a une 
sortie dudit moyen de commande 
d'adresses (12c2) pour comparer ledit 
second signal de sortie produit depuis le 
moyen de commande d'adresses avec 
I 'information d'adresse etant regu par le- 
dit moyen de commande d'adresses et 
pour produire un signal de chevauche- 
ment interdit lorsque I'information 
d'adresse associ^e audit second signal 
de sortie est identique a rinfonmation 
d'adresse etant regu par ledit moyen de 
commande d'adresses, ledit signal de 
chevauchement interdit empe chant le 
moyen de commande de decodage de 
predecoder la prochaine instruction me- 
morisee dans ledit premier moyen (12b); 
et 

- un tampon de recuperation special 
(12c5) pour repeter I'ex^cution d'une ins- 
truction. 



AnsprUche 

1. Hochgeschwindigkeltsprozessor zur Verwen- 
dung in einer EDV-Anlage. wobei der Hochge- 
schwindigkeltsprozessor Instruktionen eines 
Befehlsvorrates ausfUhrt, dessen erster Teilvor- 
rat einfacher in Hardware zu realisieren ist und 
haufiger als der ubrige Teilvorrat ausgefuhrt 
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wird, mit ersten Prozessormittein, die den er- 
sten Teilvorrat schnell ausfOhren. und mit zwei- 
ten Prozessormittein. die den Obrigen Teilvor- 
rat ausfUhren, gekennzelchnet durch 

5 

erste, in den ersten Prozessormittein (12) an- 
geordnete. Speicher (12a, 12b) zur Speiche- 
rung eines Befehls; 

Dekodierungssteuermittel (12c). die an die er- ro 
sten Speicher (12b) und an die zweiten Pro- 
zessormittel (10b) angeschlossen sind. urn den 
in den ersten Speichern gespeicherten Befehl 
zu dekodieren und urn festzustellen, ob der 
Betehl in den ersten Teilvonrat oder in den is 
Gbrigen Von^at fSllt. wobei die Dekodienjngs- 
steuermittel (12c) ein erstes Ausgangssignal 
erzeugen, wenn der Befehl zu dem ubrigen 
Teilvorrat gehort. und ein zweites Ausgangssi- 
gnal erzeugen, wenn der Befehl in den ersten 20 
Teilvorrat fa lit, und die zweiten Prozessormittel 
den Befehl als Antwort auf das erste, in den 
Dekodierungssteuermitteln (12c) erzeugte Aus- 
gangssignal, ausfuhren und als Antwort auf das 
zweite. in den Dekodierungssteuermitteln (12c) 25 
erzeugte Ausgangssignal. BefehisausfQhrung 
verhindern. 

2. Hochgeschwindigkeitsprozessor nach An- 
spruch 1, dadurch gekennzelchnet, 6sS die 00 
ersten Prozessonmittel (12) weiterhin umfas- 

sen: 

Mehrzweckregister (12d), die an die Dekodie- 
rungssteuermittel (12c) angeschlossen sind, 35 
um mindestens einen Operanden zu speichern 
und um ein fur den Operanden bezeichnendes 
Ausgangssignal als Antwort auf das zweite, in 
den Dekodierungssteuermittel (12c) erzeugte. 
Ausgangssignal zu erzeugen; 4o 

eine arithmetische und togische Einheit (12f), 
die an die Mehrzweckregister (12d) ange- 
schlossen ist und auf die in den Mehrzweckre- 
gistem (12d) erzeugten Ausgangssignale an- 45 
spricht. um eine Rechenoperation mit dem ei- 
nen Operand und einem zweiten Operand 
durchzufOhren und ein fOr die Ergebnisse der 
Rechenoperation bezeichnendes Ausgangssi- 
gnal zu erzeugen. so 

3. Hochgeschwindigkeitsprozessor nach An- 
spruch 1. dadurch gekennzelchnet daj3 

die ersten Prozessormittel (12) in iiberlappen- 55 
der Betriebsart arbeiten, wobei die Dekodie- 
njngssteuermittel (12c) der ersten Prozessor- 
mittel (12) einen in den ersten Mittein gespei- 



cherten Befehl vordekodieren. wMhrend die 
arithmetische und logische Einheit (12f) eine 
Rechenoperation mit den Operanden durch- 
fUhrt, die mit dem zur Zeit ausgefOhrten Befehl 
assoziiert sind. 

4. Hochgeschwindigkeitsprozessor nach An- 
spruch 1 und 3, dadurch gekennzelchnet. dai3 
die Dekodierungssteuermittel weiterhin umfas- 
sen: 

Adressensteuermittel (12c2; 12c3), die an die 
zweiten Prozessormittel (10b) angeschlossen 
sind, um eine Adresseninformatidn aus dem in 
den ersten Speichern (12a. I2b) gespeicherten 
Befehl zu empfangen und um ein zweites Aus- 
gangssignal zu erzeugen, wenn der Befehl in 
den ersten Teilvorrat des Befehlvorrates fallt; 

Adressenvergieichsmittel (12c4), die an einen 
Eingang der Adressensteuermittel (12c3) und 
an einen Ausgang der Adressensteuermittel 
(12c2) angeschlossen sind, um das zweite in 
den Adressensteuermittein erzeugte Ausgangs- 
signal mit der von den Adressensteuermittein 
empfartgenen Adresseninformation zu verglet- 
chen und ein Uberlappungssperrsigna! zu er- 
zeugen, wenn die mit dem zweiten Ausgangs- 
signal assoziierte Adresseninformation dieselbe 
ist, wie die von den Adressensteuermittein 
empfangene Adresseninformation, wobei das 
Uberiappungssperrsignal die Dekodierungs- 
steuermittel daran hindert, den nachsten in den 
ersten Speichern (12b) gespeicherten Befehl 
vorzudekodieren; und 

einen Sonderwiederholungspuffer (I2c5), um 
die AusfQhrung eines Befehls wiederzuholen. 
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